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Abstract 

We show how to enlarge the covariance group of any classical field the- 
ory in such a way that the resulting "covariantized" theory is 'essentially 
equivalent' to the original. In particular, our technique will render any clas- 
sical field theory generally covariant, that is, the covariantized theory will 
be spacetime diffeomorphism-covariant and free of absolute objects. Our re- 
sults thus generalize the well-known parametrization technique of Dirac and 
Kuchaf . Our constructions apply equally well to internal covariance groups, 
in which context they produce natural derivations of both the Utiyama min- 
imal coupling and Stiickelberg tricks. 
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1 Introduction 



The Principle of General Covariance has had a long (and somewhat checkered) 
history in physics (Norton [1993]). In fact, this principle is often taken to mean 
slightly different things and it is unclear what the physical ramifications of 
various formulations are. Anderson [1967] contains a careful exposition of the 
issues surrounding this idea. 

Here, motivated by Anderson [1967] and Kuchaf [1988], we adopt the fol- 
lowing precise definition: A Lagrangian field theory is generally covariant 
provided (i) the Lagrangian density is equivariant with respect to the action of 
spacetime diffcomorphisms, and (ii) all fields are variational. General covari- 
ance is of course a desirable property for a physical theory to possess, and it is 
quite useful when investigating classical field theories from both structural and 
computational points of view (Gotay and Marsden [2010], Leok [2004]). 

Unfortunately, there are various ways in which general covariance can fail 
to hold. The most obvious is that the Lagrangian density L just does not have 
the required behavior under changes of coordinates; for instance, for the 1 + 1 
Klein-Gordon equation 

1 , 

9,xt + 2 m 9 = 

with mass m on R 2 , we have 

£ = (^>,t9,x - T^m 2 ^ dt A dx (1.1) 

which is visibly coordinate-dependent. 

A straightforward way to repair (1.1) is to rewrite it as 

£ = \ {sTMr - ™ V) V-det 5 d 2 x (1.2) 

where g is the Minkowski metric. Now we have Diff(K 2 )-equivariance, but at 
the price of introducing the (nonvariational) field variable g. 1 

This brings us to another way that a theory may fail to be generally co- 
variant: it may contain an "absolute" or "background" object in the sense of 
Anderson [1967] (see also Post [2007]), in which case (ii) cannot be satisfied. In 
(1.2) g is an absolute object ("prior geometry," in the terminology of §17.6 of 
Misner et al. [1973]); as long as g is immutable, the theory is still not generally 

1 Introducing a metric is not the only way of making (1.1) geometric; we shall give another 
one later. 
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covariant, despite the fact that the Klein-Gordon equation, which now takes 
the form g 1 * 1 '4>.^ v + m 2 cf) = 0, is certainly an invariant statement. 2 

However, if we insist that g be dynamic, then varying it in (1.2) yields 

= «=^> I"" = 0, (1.3) 

where T is the stress-energy-momentum tensor density of the Klein-Gordon 
field. This forces cj> = 0, so that we have produced a system which is not 
equivalent to that with which we started. To undo this problem, we must 
modify the Lagrangian once again, possibly by adding in a source term for the 
metric g (e.g., regard g as gravity). But in doing so we would likely still obtain 
an inequivalent (albeit consistent) system. Failing this we cannot allow g to be 
variational. 

So even in this basic example, we have two starting points based on: (1.1), 
which is not spacetime diffcomorphism covariant but in which all fields are 
variational, and (1.2), which is the other way around. Thus goals (i) and (ii) 
are more difficult to accomplish — simultaneously — than might be expected. 
And clearly we need to be more subtle than simply "turning background fields 
on." 

There are other contexts in which similar issues arise. We may have a field 
theory which does not have a desired covariancc property (not necessarily related 
to spacetime diffeomorphisms) ; can we modify it in such as way as to attain 
covariance while maintaining equivalence with the original system? A well- 
known example of what we have in mind is provided by gauge theory. In this case 
we have a Lagrangian which is invariant under the action of a finite-dimensional 
Lie group, and we would like to enlarge the invariance group to the full gauge 
group. This can be accomplished by means of the Utiyama minimal coupling 
trick which, however, necessitates introducing a (nonvariational) connection. 

In this paper we show how to solve these and similar problems by systemat- 
ically introducing new dynamic fields into the formalism; basically, subject to 
a few mild assumptions, we prove that we can enlarge the covariance groups of 

2 Thus the Principle of General Covariance does not simply mean that "the laws of physics 
must be expressible as geometric relationships between geometric objects," a point emphasized 
by Misner et al. [1973], §12.5. 

3 Kretschmann [1917] showed that it is always possible to make any theory spacetime 
diffcomorphism covariant by introducing auxiliary variables. As stressed by Dirac [1951,1964] 
and Kuchaf [1988], the catch is to do this while keeping all fields variational, without materially 
changing the theory. 



3 



field theories as desired while leaving solution spaces essentially unchanged. 4 In 
particular, our constructions provide a prescription for making any field theory 
generally covariant, as well as lead to natural derivations of both the Stiickelberg 
Lagrangian and the Utiyama minimal coupling trick (conveniently understood) 
as special cases. 



2 Setup 

As usual in classical field theory we start with a configuration bundle Y X 
whose sections, denoted 0, are the fields under consideration. The dimension of 
X is taken to be n + 1, and we suppose that X is oriented. 5 Let 

£ : J r Y -> A n+1 X 

be an r th -order Lagrangian density, where J r Y is the r th jet bundle of Y and 
A n+1 X is the space of top forms on X. Loosely following the notation of 
Castrillon Lopez, Gotay and Marsden [2008a], we use {x^,y A , y A l _ ll , ■ ■ ■ y A ^ 1 ...^ r ) 
as coordinates on J X Y . In these coordinates we write 

L=L(x»,y A ,y A ^...y A ^ r )d n+1 x. 

Evaluated on the (r th ) jet prolongation of a section <fi, the Lagrangian density 
becomes a function of {x 1 *, 4> A ,4> A ,/iu • ■ • 4> A ,/ii.../i r )i where 

<\> A :=y A odp and 4> A := ° f<t> = g^f..^. 

for 1 < s < r; we shall abbreviate this when convenient and simply write L(j r (f>). 

Suppose we have a group S which acts on Y by bundle automorphisms, so 
that we have a monomorphism S — > Aut(y). (We will often blur the distinction 
between S and its image in Aut (Y).) Being an automorphism of Y —5- X, each 
cry g S covers a diffcomorphism of X which we denote by ax ■ 

We say that a Lagrangian field theory is S- covariant if the Lagrangian 
density L is equivariant with respect to the induced actions of 9 on J r Y by 
prolongation and A n+1 X by push-forward: for all ay € 5 and 7 G J r Y, 

£tf*W(7)) (7)]. (2-4) 

4 In a precise sense to be elucidated below. 

5 Often X is a genuine spacetime, but in principle it need not have a Lorentzian structure 
(e.g., topological field theories). 
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A key fact is that a covariance group acts by symmetries: it maps solutions 
of the Euler-Lagrange equations to solutions. This is established in §4.2 of Olvcr 
[1986]. 

3 General Covariance 

General covariance is concerned with the action of the diffcomorphism group of 
the base X. Since Diff(A) is not necessarily a subgroup of Aut(Y), we must 
assume that there exists a group embedding Diff(X) — > Aut(Y) covering the 
identity denoted a i— >• ay. (Such embeddings typically exist. For instance, for 
tensor theories, such an embedding is given by push-forward. See Kolaf ct al. 
[1993] and Janssens [2009] for more information.) We suppose that the fields <f> 
have differential index k, 6 so that cry depends at most on the fc th -order deriva- 
tives of ax- (Again, this is usually the case. Tensor fields in particular have 
index < 1.) 

If (2.4) holds for all a 6 Diff(A), then we say that the field theory is dif- 
feomorphism covariant. 

3A The Variational Case 

We begin by tackling the case when a given classical field theory is not dif- 
fcomorphism covariant at the outset, while assuming that all fields originally 
present are variational. By combining the ideas of Dirac [1951,1964], as further 
developed in Kucha? [1973,1988] and Isham and Kuchaf [1985], with notions 
from elasticity theory (Marsden and Hughes [1983]), we "covariantize" the field 
theory under consideration by explicitly including diffcomorphisms of X as gen- 
uine fields. This yields in particular an intrinsic reformulation of the idea of 
treating coordinate changes as fields ( "paramctrization" ) . 
To accomplish our goal, we: 

1. Introduce new fields, the covariance fields. These are just diffeomorphisms 
T] : X — > X reinterpreted as sections of the product bundle X 2 — > X. 

2. Replace Y and J r Y by Y = X 2 x x Y and J k+r Y. Coordinates on J k+r Y 

6 The "differential index" of a field is the highest order derivative that appears in its 
transformation law under diffcomorphisms of the base X, cf. Gotay and Marsden [1992]. 
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arc 



(x^ , x ,...x fj, 1 ...fj, k+r , y ^ - - - 7 2/ fi...t/ r )' 
3. Replace the Lagrangian density £ : J r Y -> A n+1 X by £ : J fe+r Y 



A n+1 X denned by 

£(/ +r (r;, 0)(a:)) = rf [£{f(r,Y ° ° V^M*)))] 



(3A.1) 



In general L will depend on only the r-jet of <j), but on the (k + r)-jet of r\ as ^ 
has differential index fc (whence 7/v will depend upon j rf). But it may happen 
that the order of L is < k + r. 

Suppose for instance k = and r = 1. Then in charts we have 

x a , x%, y A , y A v ) = L(x m ,r, A , {r 1 A B y B p + V A P )x P n) (det J), (3A.2) 

where we abbreviate r\ A = y A o 7yy and the Jacobian of the 'coordinate change' 
AT — > X is J = (x a with J -1 = (a^a)- Here we have used the chain rule to 
compute 



V A a(jHvY o(f> or] 1 ){x)) := 



d(rjY ° 4> ° V ) 
9x a 



i\4 



d(y A oi lY ) 



(0( ?7 - 1 (x))) 



9(a; M or/ x ) 



(•r) 



or simply 



y A a = (^By^ + ^j/a 
for short. Similar formula: hold for other values of k and r. 



(3A.3) 



Remark 3A.1. Note that because of (3A.3) the covariance fields derivatively 
couple in the Lagrangian density. ♦ 

Remark 3A.2. This construction is reminiscent of the "material" or "body" 
representation in elasticity theory. In this context, X plays the role of the 



7 Henceforth we use a tilde to denote objects in a covariantized theory from those in the 
original theory when they differ. We use bars to notationally distinguish the factor X R3 X in 
the fiber of X 2 (and quantities defined thereupon) from the base X (and quantities defined 
thereupon) when confusion may arise. We further use lowercase greek indices for coordinates 
on the base, and lowercase latin indices for coordinates along the fiber; when we refer to the 
same coordinate on X and X we use corresponding letters for indices, e.g. x M and x m . 
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body manifold with the x^ being body coordinates, while X that of the space 
manifold with the x a being spatial coordinates. A diffeomorphism rj : X — > X 
is a deformation, and the tangent map Tr\ is the deformation gradient. For a 
deformation-field pair (77, cf>) : X — )■ Y, the spatial field multivelocities 

A (-if , -1^ d(y A on Y O0O7T 1 ) 
V a{3 {vyo^cj )):= — 

are related to the body field multivelocities 



a ,.1,, ._ d{y A o4>) 
1 MJ 0j • dx» 



by the chain rule as in (3A.3). ♦ 

Remark 3A.3. In the computational setting (see Leok [2004]) the two copies of 
spacetime correspond to a computational domain X (where the mesh is uniform) 
and a physical domain X (where the mesh is concentrated about regions of high 
gradients), and the discrete variational principle allows one to obtain both the 
discrete diffeomorphism 77 of spacetime (corresponding to the distribution of 
nodal points in spacetime), and the field values over these points. This is unlike 
moving mesh methods, where the solution of the field values is separate from 
the choice of the distribution of the nodal points. ♦ 

Our first goal is to obtain a field theory which is diffeomorphism covariant. 
Our construction achieves this, as we now show. 

Let a G DifT(X). On sections tp of Y we define the action 

a • tp = oy o tp o er -1 . (3A.4) 

We extend this to an imbedding of Diff(A) — > Aut(y) by requiring that Diff(A) 
act trivially on X; this is consistent with viewing 77 as a point mapping of X to 
X. Thus we have 

a ■ rj = rj o cr _1 . (3A.5) 

Theorem 3A.4. The modified field theory on J k + r Y with Lagrangian density 
(3A.1) is DiS(X)-covariant. 

Proof. Suppose a G Diff(A). Then by construction and (3A.5), (3A.1) yields 

I (j k+r v Y (j k+r (v, *)(*))) = -V,o-- 0)(a(.x))) 

= (a ■ r,)* [JO {f((a ■ V ) Y ° (<r ■ <f>) ° (a ■ r,)- 1 )^)))} 
= (i]o cr- 1 )* [L (f(rj Y o4>o rf x ){r]{x)))] 
= ( jJjZ{j k+r ( v ,^)(x)) 
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as was to be shown. 



It follows that Diff(A) sends solutions of the variational problem defined by 
L to solutions. Of course, the same need not be true for £. 

Our second goal is to ensure that the modified field theory is "essentially 
equivalent" to the original one. This is also the case, as we now explain. 

We first consider the Euler-Lagrange equations of the new Lagrangian den- 
sity L corresponding to the fields cb. For this, we compute the variation of the 
action integral for compactly supported vertical variations (77, cj) e ) with <fio = cb. 
In this case (3A.1) gives 



rt_ 

de 



d_ 

de 

ti- 
de 



=0 J x 



=0 J x 



by the change of variables formula. Since r\y is fiber-preserving it is clear that 



> 77 1 is a compactly supported vertical variation of j]y 1 



1077 



In fact, 



as 7iy is an automorphism of Y , any compactly supported vertical variation of 
r/Y (f> V 1 can be obtained in this way. Hence, the derivative above vanishes 
if and only if 77Y o cb o 77" 1 is a critical section of L. 

Now suppose cb is fixed and consider compactly supported variations 77 e of 
the covariance field 77. 8 Then, reprising the previous calculation, 



d_ 

de 



=oJx 



d_ 

de 



=oJx 



(Note that (77 e )y o cb o 77 e _1 can be thought as a "horizontal variation" of the 
section 7/y ocbor]" 1 , cf. Marsden et al. [1998].) To compute this integral, let 0£ 
be a Lepagean equivalent of L on J 2r ~ 1 Y (see Gotay [1991]). As the Lagrangian 
density is the 0-contact part of 

£ (fttVefr ° ^ o t;,- 1 )) = (j 2 ''- 1 ((77 e )y o o t^ 1 ))* 6 £ 

= (vr 1 TU 2r - 1 ^T(j 2r - 1 (ve)YyeL. 

Applying the change of variables formula, the derivative of the action now reads 

-V)*(.7 2r - 1 77r)*^-- 1 v V ©£ 



d 
1c 



Cj 3r -V)*(7 ar - 1 (%)y)*e ( c= / (f 

=qJX JX 



We must vary r) within Diff(X). 
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where £ denotes the Lie derivative and W is the vector field on Y defined by the 
flow (^e)y- Undoing the change of variables and using Cartan's magic formula, 
this becomes 

(j 2r-1 (f?y ° 4> ° 'T 1 ))* ij2r-i w d<d L + / (j 2r ~ 1 (r/y o0o7/~ 1 ))*(ii i 2r-i w/ 0£. 
x Jx 

By Stokes' theorem the second term vanishes for compactly supported W . But 

if tjy o (f) o 77" 1 is a critical section of L (so that the Euler-Lagrange equations 

of L with respect to <f> are satisfied) then the first term in the above expression 

vanishes (see Krupka et al. [2010]). We have thus proven: 

Theorem 3A.5. Let (77, </>) be a section ofY. Then 

(i) (77, 4>) satisfies the Euler-Lagrange equations of X with respect to (j) if and 
only if n ■ (f> satisfies the Euler-Lagrange equations of L, that is, 

— (77,0) = ^ — (v4>) = o 

(ii) Lf (77, </>) satisfies the Euler-Lagrange equations of L with respect to <j> 
then the Euler-Lagrange equations of L with respect to the covariance 
fields n are vacuously satisfied for (r],4>), that is, 

^(r?,0)= O=^(77,0)=O. 
d(f) on 

If § and § denote the sets of solutions of the Euler-Lagrange equations of L 
and £ respectively, Theorem 3A.5 can be summarized by saying that the map 
(77, 4>) i->- (77, 77 • (f>) establishes a one-to-one correspondence 

S -> Diff(JT) x S. 

Furthermore, since L is Diff(X)-covariant, Diff(X) acts on §. We may then 
use the surjection (77, <f) 1— > 77 • cf> to identify § with the orbit space §/Diff(X). 
Thus while the covariantized theory will have many more solutions than the 
original system, corresponding to the (gauge) freedom in the choice of covariance 
fields, their solutions spaces stand in the simple relation 

S w S/DiffpO. 

Because of this we say that the covariantization process leaves solution spaces 
"essentially unchanged." As the covariance fields have no dynamics, we may 
additionally say that the original and covariantized theories are "essentially 
equivalent." 
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Remark 3A.6. Theorem 3A.5 provides, to our knowledge, the first instance 
in which the essential equivalence of the original and covariantized systems has 
been explicitly stated and proved. ♦ 



Remark 3 A. 7. The situation above amounts to a reduction process when 
viewed in reverse. More precisely, if we begin with a Diff (AT)-covariant La- 
grangian density L, reduction by the Diff (X)-action would yield the Lagrangian 
density L. Indeed, the behavior of the set of solutions § and S given just above 
is simply the relation between the set of solutions of a Diff (A)-covariant La- 
grangian and that of its reduced counterpart. ♦ 

It is worthwhile explicitly writing out the Eulcr-Lagrange equations for L 
in the special case when k = and r = 1. These equations for the fields </> 
unsurprisingly turn out to be reparametrizations of those for L. (We give an 
example in §3A.l.) The Eulcr-Lagrange equations for the covariancc fields arc 
more interesting; we may express them as follows. 

First we compute 

dL d(detJ) dL . , . 
dx a ^ dx a ^ dx a ^ ' 



d {x%C c ) dL dy A 
<9x% dy A c dx a f_ 



L^^ + ^^(detJ). (3A.6) 



In obtaining the first term here we have employed a cofactor expansion of dot J 
(in which is the cofactor of x c v and hatted indices are not summed). Now 

^lla^ = C ° S "l> = C^Wc) = ^ Q (det J). 

In the second term we use (3A.3) to evaluate 



(V A BV B , + V A „)^ = (V A BV B , + V A „) {-X V aX» c ) = -y A a X» c 



Substituting these results into (3A.6), we have 
dL / „ dL 



= [LS c a - 1 ^y A a )x^ c (detJ). 



dx a M \ dy 
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The Euler-Lagrange equations for the covariance helds may thus be written 
' dL \ 



dL(dct J) 
dx a 

dL 

dx a 

dL 

dx" 



dx a . 



x\D b 



LS c a 



dL 



dy 



v A 



: (det J) 



D, 



D, 



dL 



( dL 



i y a - L5° a 



(det J) 



(3A.7) 

(3A.8) 
(3A.9) 



V a + 



dL 

dy A t 



\ lb -D a L) (det J) 



In going from (3A.8) to (3A.9), we have used the "Piola identity" : D^(x^ c det J) = 
x b l _ l ,Db(x ,J ' c det J) = 0. We recast the first factor in this last expression in the 
form 



SL 



5y 



dL dL 



dx a dy- 



:V a + 



dL 
dy A i 



y A ab - D a L 



The term in parentheses vanishes identically by definition of the total derivative, 
whence we again obtain (ii) of Theorem 3A.5. 

However, referring to (3A.9) we see that the Euler-Lagrange equations with 
respect to r\ also take the form 

dL 



dx a 



- D b t° a = 



(3A.10) 



where 



i c a = L5 c a - 



dL 

dy / 



-y a 



is the canonical stress-energy- momentum ( "SEM" ) tensor density of L . Should 
L be independent of the coordinates x a these reduce to conservation of energy- 
momentum in the spatial representation. 

Remark 3A.8. By analogy with elasticity theory (see p. 7, Marsden and Hughes 
[1983]), we may define the two-point canonical Piola-Kirchhoff tensor density p 
to be the Piola transform of t: 

P^a=t C a^c(detJ). 

Then from (3A.7) we may equally well express (3A.10) as 

D^ a = 0. 



dL 

dx a 



When the covariance fields are ignorablc, these yield energy-momentum conser- 
vation in the material picture. ♦ 
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Thus we have achieved our objective in the case when all fields are varia- 
tional: we have taken a non-diffeomorphism covariant system and produced an 
essentially equivalent theory which is Diff (AT)-covariant. We now study some 
examples. 

3A.1 Example: Nonrelativistic Mechanics 

Issues of general covariance appear even in the most elementary systems, such 
as mechanics. For instance, consider a particle moving in a Riemannian config- 
uration space (Q,g), so that Y = R x Q. Supposing that the action of Diff(R) 
on R x Q is trivial on the second factor (as is typically the case in mechanics), 
the usual Lagrangian density 

L = {^mg AB q A q B - V(q)jdt (3A.11) 

is not time reparametrization covariant. 

To repair this we introduce a covariance field T : R — > R. 9 As the time 
reparametrization group does not act on the configuration space, (3A.2) simpli- 
fies considerably: 

L(t,T,q A ,f,q A ) = L(T,q A ,q A f- 1 )f 
where T := dT/dt, and so (3A.11) is replaced by 

I = {^mg AB q A q B f- 2 - V(q)jfdt. 

It is obvious that L is time reparametrization invariant, as guaranteed by The- 
orem 3A.4. 

Next we verify (i) of Theorem 3A.5 for first order mechanics in general. The 
Eulcr-Lagrange equations for L with respect to q A (t) are 

<9L d ( di \ _ dL ■ d f dL 

U T: T i . I TTTT I "7^ T -L ~7T 



dq A dt \ dq A I dq A dt \ dq A 



Since T is a diffcomorphism, we may use it as a parameter on R; then this 
becomes 

= t 



dL d ( dL 



A dT \d(dq A /dT) 



9 The introduction of the covariance field T is a venerable technique in mechanics, see §V.6 
in Lanczos [1970]. However, our point of view here is "active," whereas the classical approach 
is "passive." 
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Since T ^ it follows that the Euler-Lagrange equations of L are a time 
reparametrization of those of L for q A {T). 

As for the Euler-Lagrange equations of L with respect to T, we find that 
(3A.10) reduces to 

dL _ dE 

where E = — t°o is the energy of the original system. When the mechanical 
system is scleronomic, this gives conservation of energy 

3A.2 Example: The Klein Gordon Equation, Version I 

We first covariantize the Klein-Gordon theory using the Lagrangian density 
(1.1). Write the covariance field rj : R 2 -> R 2 as r}(t,x) = (T(t,x),X(t,x)). 
Since <j> is a scalar field, the action of Diff (R 2 ) on Y = M x R 2 is trivial. Then 
according to (3A.2) and (3A.3), in passing from L to L we must replace <J) t t and 
4>,x by 

<j>x' - 4>'x 



and 



t>,tV*T + (t>,xV X T 



TX' - T'X 
-4>T' + fit 



TX' - T'X 

respectively, where we have availed ourselves of Cramer's Rule and have denoted 
derivatives w.r.t t by dots and x by primes. (Observe in this regard that the 
Jacobian J = TX' — T'X.) Thus the covariantized Lagrangian is 

L = <V*(*r + tx')-<ptx-Prx> i TX , _ T , x 

TX' - T'X 2 

Of course, when r\ = idR2 this reduces to (1.1). A calculation shows that L does 
indeed have the expected transformation properties under Diff(R 2 ). 

We will give an alternate derivation of this (rather complicated-looking) 
Lagrangian in §3B.l. 

3B The Case When Absolute Objects Are Present 

Suppose the field variables can be split into two types: <fi = (ip; x), where the tp 
are variational and the x are not. 10 (Think of the latter as being background 
metrics, or tetrads, or Yang-Mills fields, etc., on A.) We correspondingly split 



10 To emphasize this distinction we use a semicolon to separate the variational from the 
non- variational fields. 
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Y as V Xx B, where the factor V refers to the variational fields and B to the 
background ones. 

We further suppose that the following two Ansatze regarding the background 
fields hold. The \: (Al) appear only to zeroth order in L, and (A2) have 
differential index at most 1. These conditions are typically satisfied in practice, 
and we do not expect violations thereof to cause problems, see for instance 
Castrillon Lopez, Gotay and Marsden [2008a]. 

Now assume that the given Lagrangian density L is diffcomorphism covari- 
ant; our problem is to covariantize the theory by eliminating the absolute objects 
X from the ranks of fields. 

We accomplish this in a manner that is similar to the technique of §3A: 
We introduce a copy X of the spacetime X into the fiber of the variational 
part V of the configuration bundle Y and adjoin diffcomorphisms rj : X — > X 
as covariancc fields. Next, we view the x, which originally were sections of 
B — > X, as sections of B — > X. Thus these quantities are no longer regarded as 
fields — and hence are no longer absolute objects in the theory — but considered 
merely as geometric objects x anchored to the fiber of the modified configuration 
bundle V = (X x X) xx V. Finally, wc modify £ to get the new Lagrangian 
density L on J r V defined by 

l(j r M)) = L{f^,n- 1 -x) (3B.1) 

where we have taken Ansatz (Al) into account. Subject to Ansatz (A2), L will 
then remain r th -order. 

The first key observation is that the modified theory is also spacetime dif- 
feomorphism covariant: 

Theorem 3B.1. The Lagrangian density L is DiS(X)-covariant. 

Proof. This is a direct consequence of definition (3B.1) and the covariance 
assumption on L. Indeed, for a € Diff(A), 

£{fvv ■ fin, *)) = i (j V ■ V),f(a ■ v-)) 

= L(f(a-iP);(a-r,)- 1 -x) 
= L(f(a-i>);a-(r,- 1 - x )) 

= <7.(<C(j r fa,^))) 
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where we use (3A.4) and (3A.5) to compute (a ■ 77) 1 • \ = (77 o a x ) 1 • x = 
(a or]- 1 ) ■ x = cr ■ (ir 1 ■ X)- ■ 

Thus the modified theory is generally covariant. It remains only to verify 
that the solution spaces for the original and modified theories are essentially the 
same. The analogue of Theorem 3 A. 5 in the present context is 

Theorem 3B.2. Let (rj,ip) be a section of V and let % '■ X —5- B be given. 

(i) (77, iji) satisfies the Euler-Lagrange equations of L with respect to the 
fields ip if and only if (tp ]"q~ l ■ x) satisfies the Euler-Lagrange equations 
of L with respect to ip, that is, 



(ii) If(r],ip) satisfies the Euler-Lagrange equations of L with respect to the 
fields i\) then the Euler-Lagrange equations of £ with respect to the co- 
variance fields 77 are vacuously satisfied for (r),ip), that is, 



Proof. Part (i) follows immediately from the definition of L. 

For (ii) we consider a compactly supported variation rj e = F e o 77 of 77, where 
F t is a vertical flow on X x X — > X. We compute from (3B.1), Diff(A)- 
cquivariance, and the change of variables formula that 




^(7A;t7- 1 -x) = 0. 









(3B.2) 
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where the partial derivative here is evaluated along the section (j r (rj ■ ip) ; 
Write the infinitesimal generator of the induced flow (F £ ) g on B as 

£b = ? b — + r — 
Then from (3A.4) and the expansions 

(x°F e ) s *x s + ex s , b e and ((F_ e ) s o f « f - £ f 
to order e, we have that 

d(F_ € .x) s _ pb ps 
de -X- b * ~^ 

(This expression is the vertical part of the vector field £b with respect to the 
splitting T%(x)B = V%(x)B © im (Tsx) induced by jlx-) As x nas differential 
index < 1, we may write 

f = C S U b + C S \e,a 

for some functions C on B (see Gotay and Marsden [1992] for specifics). Plug- 
ging into (3B.2), we end up with 



d_ 

de 



&(f(ve,i>))= I ^(ix s M-c s b ]e-c sa b e.a)d 

-_ Jx Jxdx s \ > 



sa rb \ in+1- 



SL 
lxW s 

where we have used (Al) in going from the first line to the second. 

Now suppose the ip are on shell, so that by part (i) the pair (ip;r) ■ x) 
satisfies the Euler-Lagrange equations of L with respect to ip. We observe that 
since L is Diff(A)-covariant, the pair (rj-ip ;x) also satisfies the Euler-Lagrange 
equations of L with respect to ip (i.e., r\ is a symmetry). Bearing this in mind, 
let T be the SEM tensor density of L; then the generalized Hilbert formula (3.3) 
in Gotay and Marsden [1992] and Prop. 1 ibid give 

1 a b = -C sa b -^- and D a % a b = (x s , b - C s b ) |^ 
OX ox" 

along the section (j r (r) ■ip);x)i respectively. Thus the r.h.s. of the expression 

above reduces to 

J X J X 

= f D a (T\e)(detJ)d 
Jx 



n+1 x 
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where we have again used the change of variables formula. 
But now a computation along with the Piola identity yields 



D a (% a b ^ b ){detJ)=D^ b e) 

where y b = a^ a 1 a fc (det J) is the Piola-Kirchhoff SEM tensor density of L. 
Thus 



d_ 



= o J x Jx 



which vanishes by virtue of the divergence theorem, since by assumption £ (a;, x) 
is compactly supported as a function of x £ X. ■ 

Thus for given x the map (77, tp) i— >• (77, (tp ; 77 -1 ■ x)) provides an isomorphism 
of § with Diff (X) x S. 

3B.1 Example: The Klein Gordon Equation, Version II 

We covariantize the second version of the Klein-Gordon Lagrangian density 
(1.2). View the metric as being anchored to the fiber X, so that g^ v — 

x tJ/ a.x"bg ab , and take 

L = Cx\x\g ab <j>^,v ~ \^ 2 <P 2 ) (det J). 



Choosing coordinates on X in which 



9 



1 

1 



and again writing 77 = (T,X), one finds that L reduces to the Lagrangian of 
§3A.2. 

The case when a metric on spacetime is the only background field was ex- 
haustively analyzed in Castrillon Lopez, Gotay and Marsden [2008a]. There, it 
was shown that the vanishing of the Euler-Lagrange expressions for the covari- 
ance fields could be viewed as a consequence of the Bianchi identity V^T^^ = 
for the Hilbert SEM tensor density 1^" y = -2 SL / 'Sg^. 



4 Enlarging Vertical Automorphism Groups 

Now we consider the case when a group S acts vertically on the configuration 
bundle Y, that is, the elements of S arc 7rxy-bundlc automorphisms Y — > Y 
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covering the identity on X. It can happen that 9 is not an invariance group of 
the Lagrangian density £ even though, for instance, (i) 9 acts by symmetries, 
or (u) some subgroup is an invariance group. Thus it may be desirable to find 
another essentially equivalent field theory for which 9 is an invariance group. 

4A The Construction 

As with diffcomorphism covariance, we will define a new bundle Y — > X and a 
new Lagrangian density L admitting 9 as an invariance group. For this we make 
the additional assumption that 9 can be identified with the set of sections of 
certain bundle E — > X, so that if r\ £ 9 then r\(x) £ E x C Diff(l^). (This is the 
case, in particular, in gauge theory; see §4B.3. In other situations, one may wish 
to identify 9 with some subset of sections of E.) Mimicking the constructions 
in the previous section we achieve this by setting 11 

Y = E x x Y 

and 

L(j k+r ( V ^)(x))=L(f( VY o^(x)). (4A.1) 

Thus sections rj of E will play the role of covariance fields in the present context. 
Now the group 9 acts on the new bundle in a natural way by 

ay((r,(x),<P(x))) = (7K*)° CT (*rVy(#c))) (4A.2) 

for any (r)(x),4>(x)) £ Y and section a of E — > X (compare (3A.5)). With 
respect to this action we immediately verify, in the manner of Theorem 3A.4, 
that the modified Lagrangian density is invariant under this 9-action: for all 
a £ 9 and 7 £ J k+r Y, 

Note that, in contrast to (2.4), there is no push-forward on the r.h.s. as the 
group action is vertical. (Thus we speak of mvariance in this context, rather 
than covariance.) 

Following the now familiar pattern, we find that the Euler-Lagrange equa- 
tions of L with respect to the fields <j> are essentially the same as those of the 
original Lagrangian L. With respect to the new variables rj, no more conditions 
are added, that is, they do not contribute to the dynamics of the variational 

11 Analogous to §3, we here take k to be the order of the highest derivative of r] £ S that 
appears in rjy o <j>, where r\ is considered a a section X — > E. 
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problem. In fact, Theorem 3A.5 holds word-for-word in this context, and the 
proof is also similar, but simpler, as rjx = idx- 

4B Examples 

We provide several illustrations of our technique, beginning with an example of 
circumstance (?) above. 

4B.1 Abelian Chern Simons Theory 

Treating the connection A as an ordinary 1-form on a 3-manifold X, the La- 
grangian density is 

Li^A) =dAAA. 

The additive group C°°(X), thought of as sections / of X x M — >• X, acts 
on T*X according to 

f T *x(A x ) = A x + df(x). (4B.1) 

for A x £ T*X. The Lagrangian density is not invariant under this action; 
nonetheless, (4B.1) is a symmetry: if A is a solution (i.e., dA = 0), any fr*x-A = 
A + df will be as well. So it is natural to search for an equivalent field theory 
which is C°°(X)-invariant. 

Our constructions produce the modified configuration bundle 

Y = (X x R) x x T*X 

and the (first order) Lagrangian density 

A)) =dAA{A + dri). 

As expected, this is invariant under the action (4A.2) which here takes the form 

fy( V ,A) = (r 1 -f,A + df). (4B.2) 

The Euler-Lagrange equation for A is now d(2A + drj) = <i=^ dA = and 
that for rj is vacuously satisfied, consistent with our results and the fact that 
C CC (X) is a symmetry group of the original system. 

Remark 4B.1. All this can be generalized to the case when A is regarded as 
a connection on a trivial principal G- bundle over X, with G not necessarily 
Abelian. The Lagrangian density becomes 

L = tr ( A A Q A - -A A [A, A]) 
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where S1 A = dA + [A, A] is the curvature. For non-trivial bundles, a variational 
theory can be given in terms of the bundle of connections as the configuration 
bundle Y and the adjoint bundle Ad(G) as the bundle E whose sections are sym- 
metries. See §4B.3 following, and Castrillon Lopez and Muhoz Masque [2001] 
and Tejero Prieto [2004] for more details. For higher dimensional manifolds X, 
the situation is more complex. In any case, good references are Freed [1995] and 
Freed [2009]. ♦ 

4B.2 The Stuckelberg Trick 

We consider the Proca field A, a 1-form on 4-dimensional spacetime (X, g), with 
Lagrangian density L : J 1 (T*X) A 4 X being 

LtfA) = \{- \\dA\\ 2 + m 2 \\A\\ 2 ) ^drtg~d 4 x. (4B.3) 

Here the norms are taken with respect to the metric g. The Proca equations 
are * d* dA = m 2 A where * denotes the Hodge star operator of g. 

Thinking of the Proca field as being electromagnetism but with a massive 
photon, we see that the mass term in L breaks the electromagnetic gauge in- 
variance A h-» A + df. We may restore this invariance via our construction, 
even though these electromagnetic shifts are not even symmetries of the Proca 
equations. 

Formally, everything works as in the Chern-Simons example. We have 

£{]\il,A)) = i( - ||gL4|| 2 +m 2 \\A + dr,\\ 2 ) y/=fetg~d 4 x 

which is obviously invariant under (4B.2). This is the "Stuckelberg Lagrangian" 
and the passage from the Proca Lagrangian to it is known as the "Stuckelberg 
trick" Stuckelberg [1957]. The Euler-Lagrange equations for A are now 

-kd-kdA = m 2 (A + drj) 

and, using these, we find that the field equation d * (A + drj) = for the 
"Stuckelberg scalar" rj is identically satisfied, as expected. 

Although the Stuckelberg trick seems innocuous enough, the correspond- 
ing field theories are structurally quite different: the Stuckelberg theory is 
purely first class (in the sense of Dirac; see Gotay and Marsdcn [2010]) while the 
Proca theory is purely second class. (Indeed, the historical importance of the 
Stuckelberg trick was to effect this change, cf. Banerjee, Banerjee and Ghosh 
[1995].) Despite this, the two theories are essentially equivalent as defined above. 
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4B.3 The Minimal Coupling (or Utiyama) Construction 

The main instance of circumstance (ii) above is gauge theory. In this context, 
we have a field theory invariant under the action of a finite-dimensional Lie 
group G which we wish to 'gauge,' as we now explain. 

Let P — > X be a principal G-bundle. Gauge transformations are vertical 
diffcomorphisms $ : P — > P such that R g o $ = $ o R g for any g G G, where 
R g stands for the right action of G on P. These transformations can be seen 
as sections of the Adjoint bundle Ad(G) — > X; this is the bundle associated 
to P with respect to the conjugate action of G onto itself. More precisely, 
Ad(G) = (P x G)/G where the action is (p,h) ■ g = {R g {p),g~ 1 hg), for any 
(p, h) G P x G, g G G. Let (p,g)G stand for the class of (p, g) in Ad(G). Given 
a section 77 of Ad(G), the mapping $ ?) : P — > P defined as 

$ v (p)=R g (p), where t)(tt X p(p)) = (p, flOc 

is readily seen to be equivariant and vertical, that is, a gauge transformation. 
Moreover, any gauge transformation can be identified with a section of the 
Adjoint bundle via the equation above. We denote the collection of all sections 
of Ad(G) — > X by S; it is the gauge group. 

Let V be a vector space on which G acts linearly and Y = (P x V)/G be 
the associated vector bundle. Then G acts Y on the left by vertical bundle 
automorphisms according to 

9 ■ (P, v)g = (p,g- v)g = (Rg-ip, v) G - 

Consequently, S also acts on Y according to 77 • (p, v)g = ($77 (p), v)g- Thus we 
have gauged the action of G on V: we have extended the 'global' (i.e., base- 
independent) action of the (finite-dimensional) group G by that of the 'local' 
(i.e., base-dependent) action of the (generally infinite-dimensional) group 9- 

The situation encountered in gauge theory is when the associated vector 
bundle Y — > X is the configuration bundle of some field theory with Lagrangian 
density L. Typically L is G-invariant, but not S-invariant. However, one re- 
quires invariance under S for physical reasons; we now apply our prescription 
to achieve this. Thus we take 

Y = Ad(G) x x Y 
and define the modified Lagrangian density L by (4A.1). 
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We refer to a local trivialization U x G of Ad(G) — > X to obtain a clearer 
understanding of this construction when r = 1, fc = 0. Using matrix notation 
and the Leibniz rule, from (4A.1) we obtain the (first order) Lagrangian 

L(x», V A b,V A b„ y A , y A ») = L(x^ V A B y B , V A B y B , + V A Bfl y B ) 

= L{x», v A B y B , v a b [y\ + (v- v ) b dv D c,y c ] ) 

where we have isolated a factor of r\ A b in the last argument. Now by assumption, 
L is G- invariant: 

L(x», V A By B ,V A By\) = L(x^y A , y A ,) . 

Applying this to the r.h.s. of the preceding equation gives 

L(x»,r, A B ,r) A Bfl , y A ,y A J = L(x»,y A , y A „ + fr" 1 ) Wc>2/ C ) ■ (4B.4) 

Note that (t]~ 1 ) A dV D C/j, are the components of the matrix ry -1 ^, and as dr\ is 
pointwise an element of the bundle T*U®TG, this expression is nothing but the 
identification of drj as belonging to T*U <g> g corresponding to the identification 
TG = G x g given by (g, g) >-> (g, TL g -i ■ g), where g is the Lie algebra of G. If 
we write A for the variables oi T*U <E) 0, we have finally 

L(x^,7 1 A B ,7 1 A Bfl ,y A ,y A tl ) =L(x^,y A ,y A ^ + A A B y B ). 

On the other hand, the first jet bundle of U x G -> U is T*C7 x TG -> 
t/ xG-ff. Thus with the identification above we can write 

J^C/ x G) = G x (T*[/<g> g). 

At this point we observe that even though the modified Lagrangian density £ 
was initially defined (with respect to the given local trivialization) on the jet 
space 

J X Y\ U = J^Y\ U x (T*U ® TG) = J*Y\ U xGx (T*U ® g), 

it has no explicit dependence on 77. We can thus drop the factor of G in the 
fiber on which L is defined and simply view 

£ : J X Y\U x (T*U® g) -> A n+1 C/. (4B.5) 

We now notice that: 
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1. The bundle T*U ®q is the bundle of connections of the principal bundle 
P\U = U x G of which Y\U = U x V is an associated vector bundle. 



2. The expression V M (/) A = <j> A , l i + {ri 1 ) A dV D c,^ is the covariant deriva- 
tive of the field <j> with respect to the connection A = rj^ 1 drj. 

3. From (4B.4) the variational equations SL/5<fi B (77, 0) = can be written 

dL_ ^ ( _dL 



- V M^Tb)=0. (4B.6) 



This is the well-known minimal coupling (or Utiyama) trick. Our prescription 
thus transforms the G-invariant Lagrangian density L into a 9-invariant La- 
grangian density £ by changing partial derivatives into covariant derivatives 
with respect to a connection. This connection A (and not the field rf) is the 
new field of the modified Lagrangian according to the classical gauge-theoretic 
prescription, while (4B.6) are recognized as the Euler-Lagrangc equations (for 
the matter fields) in gauge theory. 
Some remarks arc in order: 

Remark 4B.2. We have seen in §4A that the Euler-Lagrangc equations of L 
for the fields (77, 0) are equivalent to the Euler-Lagrange equations of £ for 77 • 0. 
This is of course still true. But if instead we consider the fields on which the 
modified Lagrangian density depends to be A and cj>, this result no longer holds. 
Indeed, the variational equation with respect to A forces 

d~L dL ^ 



dA^ d<f> A 4l r 

which is in general inconsistent with the dynamics of the original variational 
problem (compare (1.3)). This is why the minimal coupling trick requires the 
addition of a source term depending on the connection A and its first derivatives 
to the Lagrangian density. This term must be gauge-invariant; while the Yang- 
Mills Lagrangian density is the usual choice, a result of Utiyama [1956] (see also 
Garcia Perez [1977]) provides a characterization of all such Lagrangians in term 
of the curvature of the connection. 

On the other hand, it is amusing to wonder what would happen if we treated 
A as simply a background field, and then generally covariantized the theory as 
in §3B? 'Second covariantization' ? ♦ 
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Remark 4B.3. Connections A of the type r/^ 1 dr\ are flat. Thus the prescription 
we provide here is the minimal coupling trick where the new field (the connec- 
tion) has no curvature. As the curvature is understood as the field strength 
of A, this explains physically why for these connections, the Euler-Lagrange 
equations of the covariantizcd theory will be equivalent to those of the original 
theory: a gauge transformation can be used to reduce A = 0. ♦ 

Remark 4B.4. The condition for A (as a section of T*U ®q — > U) to be flat is 
equivalent to the covariance field r\ being a holonomic section of J 1 Ad(G)| U = 
T*U ® TG — G x (T*U $5 g). Hence we could say that the classical minimal 
coupling trick just takes the construction of §4A and assumes that the connec- 
tion A need not be holonomic. This can be seen, somehow, as natural because 
the new Lagrangian density L depends only on A, not on 77, and hence we could 
'forget' that A is induced by the holonomic section 77. ♦ 

5 The General Case 

Finally, suppose the group of ttxy- bundle automorphisms under consideration 
acts neither 'horizontally' (as in §2) nor purely vertically on Y (as in §3). For 
instance, consider a field theory based on a trivial principal bundle lxG->X, 
which is not generally covariant and which we wish to gauge. Since Diff (X) acts 
on X x G and both it and the gauge group C°° (X, G) can be realized as sections 
of bundles over X, we may straightforwardly extend our technique to this case, 
regarding the group S = Diff(X) x C° C (X, G) as sections ofXxXxG—>X, 
and then covariantizing with respect to 9- 

The case when the relevant group 9 cannot be realized as sections of some 
bundle over X is more complicated; nonetheless, our constructions still work 
with the following adjustments. As in Castrillon Lopez, Gotay and Marsden 
[2008b] we "concatenate" Y x Y — > Y and Y — > X to obtain the modified 
configuration bundle Y x Y xx Y — > Y xx X = Y. Sections of this bundle 
will consist of pairs (rj,<p), where 7/ £ 9 is a 7rxr-bundlc automorphism and 
the "fields" <fr : Y — > Y are related to the original fields by <j> = (f) o ttxy • The 
automorphisms r) : Y — » Y are the covariance fields in this approach. 

As the modified Lagrangian density on J r+k (Y x Y Xx Y) we take 

£{f +k (r), 4>)(y)) = Vx* [C{f(v ■ Mvxfr)))] S N (y - <t>(x)) 
where x = ttxy{u) and N is the fiber dimension of Y — > X. As the delta 
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function is a scalar density of weight 1 along the fibers of Y — > X, and £ is is a 
scalar density of weight 1 on X, L is a scalar density of weight Ion 7. These 
observations coupled with a computation along the lines of that contained in 
the proof of Theorem 3A.4 shows that £ is S-covariant. 

Finally, Theorem 3 A. 5 remains valid as stated, the proof needing only minor 
modifications. 

Acknowledgments. We would like to thank Jair Koillcr, Mclvin 
Leok, Jerry Marsden, Peter Michor, Joris Vankerschaver and Hiroaki Yoshimura 
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